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in a Point-Foot Series Elastic Biped: 
Balance on Split Terrain and Undirected Walking 
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Abstract —In this paper we present advancements in control 
and trajectory generation for agile behavior in bipedal robots. 
We demonstrate that Whole-Body Operational Space Control 
(WBOSC), developed a few years ago, is well suited for achieving 
two types of agile behaviors, namely, balancing on a high pitch 
split terrain and achieving undirected walking on flat terrain. 
The work presented here is the first implementation of WBOSC 
on a biped robot, and more specifically a biped robot with 
series elastic actuators. We present and analyze a new algorithm 
that dynamically balances point foot robots by choosing footstep 
placements. Dealing with the naturally unstable dynamics of these 
type of systems is a difficult problem that requires both the 
controller and the trajectory generation algorithm to operate 
quickly and efficiently. We put forth a comprehensive develop¬ 
ment and Integration effort: the design and construction of the 
biped system and experimental infrastructure, a customization of 
WBOSC for the agile behaviors, and new trajectory generation 
algorithms. Using this custom built controller, we conduct, for 
first time, an experiment in which a biped robot balances in a 
high pitch split terrain, demonstrating our ability to precisely 
regulate Internal forces using force sensing feedback techniques. 
Finally, we demonstrate the stabilizing capabilities of our online 
trajectory generation algorithm in the physics-based simulator 
and through physical experiments with a planarized locomotion 
setup. 


I. Introduction 

Controlling contact with the external world is critical for 
smooth operation of bipedal robots in cluttered environments, 
quickly climbing rough surfaces, and safely colliding with 
objects. Addressing these goals necessitates that we understand 
the hardware and computational requirements of general agile 
behaviors and that we validate such behavior in physical 
systems. The main objective of this paper is to advance agility 
by leveraging Whole-Body Operational Space Control ||T| 
(WBOSC). We focus on dual contact maneuvers in extreme 
terrain, and undirected walking. We achieve this by (1) build¬ 
ing a hardware infrastructure based on a series elastic point 
foot bipedal robot, (2) leveraging WBOSC to regulate internal 
force behavior and achieve dynamic locomotion, (3) devel¬ 
oping a new trajectory generation algorithm for undirected 
walking, (4) testing dynamic locomotion with a physics based 
simulation, and (5) conducting various experiments on agility. 

Whole-body Operational Space Control is a framework 
which returns the joint torques consistent with a desired set 
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of operational space accelerations, known contact constraints, 
and desired internal forces. The internal forces, during multi¬ 
contact, correspond to the linear subspace of joint torques that 
do not cause accelerations of the robot. The basic contact- 
consistent whole-body operational space control structures 
were laid out in Q and then extended in |[T) but only 
demonstrated in simulation. This differs from the work of 0, 
a related strategy using torque controllers to optimize the dis¬ 
tribution of reaction forces, in that WBOSC considers internal 
forces explicitly, and separately from the operational space 
acceleration goals, and places them under feedback control. 
This feature also separates WBOSC from the controller of Q, 
which acknowledges the internal forces but specifies reaction 
forces and does not impose a feedback law on the internal 
component. Implementation of a whole-body controller for 
quadrupedal robots with optimized distribution of the reaction 
forces is described in Q. Hardware experiments on a series 
elastic actuated quadruped robot with actuated ankles using 
quadratic programming to minimize tangential reaction forces 
are shown in 0. Approaching the problem from a plan¬ 
ning perspective, 0 explores advanced optimization methods 
to solve the multi-objective contact dynamics of graphical 
avatars. Recently this team has begun to test quasi static 
contact tasks in small size humanoid robots. The above is just 
a short list of whole-body controllers of similar type. There is 
an abundance of whole-body task controllers for legged robots 
that we shall review further below. In view of the available 
whole-body controllers, the work presented here is the first 
to implement a whole-body operational space controller on a 
point foot biped robot, it is the first to show biped balancing 
on high pitch split terrains, and is the first to use whole-body 
operational space control for biped dynamic locomotion. 

In essence, the main contributions of this study are: (1) 
creating a bipedal robot infrastructure based on whole-body 
operational space control, (2) incorporating sensor-based feed¬ 
back controllers for internal force regulation during balancing, 
(3) introducing a new online trajectory generation algorithm 
for undirected walking, and (4) assessing the performance of 
whole-body operational space control for balancing on a high 
pitch split terrain and for undirected walking. 

II. Related Work 
A. Control of Robots with Point Feet 

Point-foot biped robots similar to ours have been widely 
used for dynamic locomotion ®-II3 due to their mechanical 
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simplicity. Point-foot robots have the potential to accomplish 
more agile swing trajectories without the weight of an ankle. 
Point feet, however, are more difficult to control than robots 
with powered ankles since they lack the ability to apply a 
torque to the ground. Few have managed to walk upright 
without the help of a constraint mechanism, the two most 
notable examples being the hydraulically actuated hopper from 
0 and the biped from m- This fundamental difficulty can 
still be meaningfully addressed in a planar case, and the vast 
majority of research on point foot biped robot locomotion 
focuses on robots constrained to a plane. 

Similar to GD our robot is electrically powered and has 
six actuated leg joints. In terms of servo rates, point foot 
biped systems are especially unforgiving, since it is difficult 
to reduce the fundamental time constant for falling over. This 
makes sensing systems for point foot bipeds different from 
those of robots with actuated ankles, such as Atlas H), 
Valkyrie and Sarcos | [T8| in that they must be focused on 
fast maneuvers. To accommodate their fast dynamics, special 
care must be taken to ensure that the control systems for these 
robots operate at high frequency and that motion is detected 
at high speed. Relative to the infrastructure of other point foot 
robots such as 0 and 0, our setup includes an additional 
overhead motion capture system, which communicates with 
the control computer to allow absolute position feedback. We 
do not have sensors on the planarizer’s joints or slider. 

B. SEA Control 

Series Elastic Actuators (SEAs) are designed with an elastic 
element and provide two important benehts over their rigid, 
or directly connected, counterparts; they offer improved force 
control accuracy 0, 0, and a lower output inertia. This 
high hdelity torque tracking is critical to contact scenarios 
where the internal multi-contact forces must be maintained 
within a certain range, as is the case when a robot balances 
on steep terrain. Some robots, for example the robots of |19| 
and 0, use SEAs only for energy storage, and do not take 
advantage of their force control capabilities. Hume, on the 
other hand uses SEAs primarily for their force control and 
inertial benehts to handle collisions. Yet the inclusion of SEAs 
comes at a cost to performance. Controllers that have been 
designed to approximate the dynamics of a perfect position 
source, a perfect torque source p0)-p2), or a second order 
mass-spring-damper system all face the same problem in their 
hnal closed loop system; the dynamics of the reference plant 
are impossible to achieve at the highest frequencies. 

The use of SEAs for biped locomotion was pioneered 
in | |2^ , which suggests a straightforward PD torque control 
strategy with some model-based feedforward terms. However, 
the work does not compensate for static errors. The same 
torque controller was used in the SEA actuated robot | [24| to 
study push recovery. However, such an application does not 
require accurate force tracking so much as position control, 
and thus it is not possible to asses the torque performance 
of these actuators. Trajectory following accuracy did appear 
to be limited by the SEA compliance; In the author’s own 
words they state; “Due to the use of SEAs with very compliant 


springs, we have had difficulty to quickly and accurately swing 
the leg,” A sentiment echoed by our own observations. Recent 
studies describe potential solutions for this type of problem 
with SEA actuators 0, 0-0, with passivity based 
controllers emerging as a solid, if conservative, approach. 0 
adds an inner motor velocity loop and incorporates integral 
torque action to the controller. More recently, | |27t compared 
the stability and performance of various active impedance 
control approaches based on cascaded SEA controllers. 


C. Whole-Body Controller Design 

Most existing experimental approaches for whole-body con¬ 
trol are based on optimization methods which were pioneered 
by 0. 0 represents the hrst implementation of full dynamic 
based task controllers with contact constraints on a humanoid 
robot. The work focuses on push recovery and basic walking. 
In | |28) the implementation of hierarchical inverse dynamics 
algorithms using quadratic program solvers is presented on 
a Sarcos biped robot. Experiments include balancing while 
withstanding external forces, balancing on a moving platform, 
and balancing on a single foot. In p9| a torque-based whole- 
body controller is presented for controlling the Atlas and 
Valkyrie humanoid robots. A quadratic program solver is used 
to minimize momentum rate objective, contact force regula¬ 
tion and task acceleration regularization. In pO) whole-body 
control with inequality constraints via inverse dynamics and a 
quadratic program solver is proposed on the humanoid robot 
HRP-2. However, the algorithm is used offline to generate 
trajectories that are then tracked by the robot. In pT) a torque- 
based whole-body controller focused on optimization of mul¬ 
ticontact wrenches over a period of time corresponding to 
center of mass (COM) movement is presented. The approach 
is, for now, presented in the context of balancing and is only 
shown in simulation. All of these works aim only to control 
humanoids with actuated ankles, and thus do not consider 
the under-actuation situations endemic to dynamic point-foot 
walking. Separately, in p^ optimization methods based on 
inverse dynamics projected on the contact null space are used 
to control the gait of a quadrupedal robot. In contrast to these 
works, ours is the hrst to implemented an operational space 
inverse dynamic algorithm in an underactuated bipedal point 
foot robot. It does not attempt to control the center of mass 
during locomotion, relying solely on estimation via prismatic 
inverted pendulum dynamics. Additionally, the previous works 
have not implemented sensor-based internal force control nor 
have they attempted to balance the robots on such extreme 
surfaces. 

In particular, between those controllers that perform task 
space inverse dynamics, there are controllers which calculate 
internal forces as a byproduct of another optimization and 
there are those that require the specihcation of the internal 
forces of contact. Methods which deal with contact forces 
as generally interested in balance 0, 0, p^ rather than 
in controlled interaction with the environment. Eor instance 
maintaining a stance between highly sloped surfaces. In their 
work the main objective is to maintain balance while keep¬ 
ing the reaction forces within friction cones so contact is 
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maintained. [3^ used the reaction force method to keep a 
quadruped on a surface with a approximately 40° inclinations. 
However, until now, no group has approached to problem of 
accurately controlling these internal forces through feedback 
control. Previous methods have fed the desired reaction forces 
into their inverse dynamics routine as a feed-forward term, and 
the error between the actual internal force and the achieved 
internal force was not used to lower this error through joint 
torque. Such feedback would potentially require significant 
modification to optimization based method of determining the 
feed-forward reaction force. 

Whole-Body Operational Space Control Q, is centered 
around the idea of achieving operational space impedance 
control and controlling the internal forces between supporting 
contacts. In particular, internal forces are chosen such that they 
are decoupled from the motion of the robot and can be directly 
specified by a high level planner. By using feedback control 
on these internal forces we can achieve higher precision force 
tracking which is less sensitive to modeling errors than it 
would if we used feed-forward internal forces. 


D. Locomotion 


A hybrid dynamical systems problem, point-foot locomotion 
in the plane is difficult because single support motion is 
under-actuated and thus the system can only be stabilized 
when the discrete effects are taken into accountQOne of the 
most successful approaches to this problem comes from 
wherein the feedback whole body controller constrains the 
dynamics to match a model which is predicted by a simu¬ 
lation to be stable in the discrete sense. Another successful 
approach based on hybrid zero dynamics is the line of work 
by p5) which utilizes human inspired trajectories to generate 
stable periodic locomotion. These formulations are designed 
to achieve periodic motions, as is also the case for other works 
based on Poincare maps GD- 

Algorithms such as the capture point | |24| and phase space 
planning approaches | |3^ can be used to stabilize the pendu¬ 
lum model by modifying footstep locations. Phase space plan¬ 
ning extends the linear pendulum model to consider the case 
where the center of mass height is a function of the horizontal 
position, the prismatic inverted pendulum model p7| . Thus 
one objective of this paper is to assess the performance of our 
whole body controller in the task of restricting the dynamics 
to match those of a prismatic inverted pendulum model. In 
this last conference paper we presented an earlier version of 
the trajectory generation algorithm considered in this paper. 
However, no control of internal forces or experiments of any 


type were conducted. As stated in 1381 there is still little work 
on aperiodic walking. Our trajectory generation algorithm 
achieves aperiodic gaits by exploiting a simple time to velocity 


reversal rule. It is in some ways close to the algorithm by |391 
but designed for balancing in 3D instead of walking in the 2D 
plane. 


* Provided we are willing to neglect the extra controllability afforded by the 
Coriolis and gravity terms. 



Fig. 1. Hume Robot Kinematics, as when attached to the planaiizing link¬ 
age. Blue schematics describe the floating base joints, while black describes 
the kinematics of the six leg joints and three planarizer joints. The planarizer 
kinematics are not included in the generalized joint vector, since they are 
not part of the robot’s model. The locations of the LED tracking markers 
identified by the PhaseSpace Impulse Motion Capture system are shown as 
red dots. 


III. System Characterization 
A. Hardware Setup 

Our robot is a teen-size humanoid robot measuring 1.5 
meters in height and 20 kg in weight. The leg kinematics 
resemble simplified human kinematics and contain an ad¬ 
duction/abduction hip joint followed by a flexion/extension 
hip joint followed by flexion/extension knee joint as shown 
in Fig. The lack of ankle joints allows the shank to be 
essentially just a lightweight carbon fiber tube. At the tip of 
the shank we have incorporated contact sensors which are 
essentially limit switches. The series elastic actuators on all 
six joints are based on a sliding spring carriage connected to 
the output by steel ropes. The deformation in the springs are 
directly measured within the carriage assembly. The concept, 
kinematics, and specifications of the robot were proposed 
by our team at UT Austin, and the design was executed in 
collaboration with Meka Robotics and manufactured by that 
company. For fall safety the robot is attached to a trolley 
system with a block and tackle system which allows easy 
lifting and locking at a height. In addition, the robot has a 
removable planarizer mechanism which constrains the motion 
of the robot to the saggital plane. The pitch of the robot 
remains unconstrained, as the planarizer connects to the robot 
through a set of bearings. Ultimately, pitch, forward motion, 
and vertical motion are allowed, while lateral motion, roll, and 
yaw are prohibited. 

From an electrical point of view the robot is controlled with 
distributed digital signal processors, connected by an EtherCat 
network to a centralized PC running a real-time RTAI Linux 
kernel. This communication system introduces a 1ms delay 
from the linux machine to the actuator DSPs and back. Each 
DSP controls a single actuator, and they do not communicate 
directly with each other. Power is delivered through a tether 
from an external source. 

A Microstrain 3DM-GX3-25-OEM inertial measurement 
unit (IMU) on the robot’s torso measures angular velocity 
and linear acceleration, which is used in the state estimator. 
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Fig. 2. Overall Control Diagram. This figure illustrates the whole body operational space control process (WBOSC) and the joint-level torque controllers. 
One of our main contributions comes from the feedback control of internal forces. Note that the gains for the controllers are treated as additional input 
parameters to represent the gain scheduling we perform in order to achieve the best possible performance with each task. 


Additionally, the robot has a full overhead PhaseSpace Impulse 
motion capture system which gives it global coordinate infor¬ 
mation about seven uniquely identifiable LED tracking devices 
mounted rigidly to the torso. The PhaseSpace system produces 
a data stream at 480Hz and communicates to the Linux Control 
PC via a custom UDP protocol. There is an approximate 
15ms delay in the data for feedback purposes. It accomplishes 
this using a system of sixteen high speed cameras mounted 
on the ceiling above the robot, and a proprietary software 
package to fuse their readings into a single estimate for the 
three dimensional position of each marker. On each update, 
the system reports the location of as many of the uniquely 
identifiable LED markers as it can see. 

B. End-to-End Controller Architecture 

The feedback control system is split into six joint level 
controllers and a centralized high level controller (see Fig. |^. 
The purpose of the joint level controllers is to achieve good 
torque tracking given the series elastic actuators. This type 
of control architecture falls into the category of a distributed 
control system which allows the joint controllers to focus on 
high speed actuator dynamics while the centralized controller 
does not need to deal with this nuance. Yet the feedback at the 
high level is necessary in order to create the coupling between 
joints implied by operational space impedance tasks as well as 
regulating the internal forces between multicontact supports. 


C. Series Elastic Actuators 

The robot came with MEKA’s pre-loaded joint controllers 
based on the passivity torque controller described in m 
(shown on the lower right corner of Fig. [^. We kept this 
controller’s structure while changing the gains of the feed¬ 
back controller to enhance the performance of the high level 
controller, and this ultimately entailed reducing the low level 
torque gains. In order to tune the torque gains we leveraged 


our findings in |25|. In this study we describe a trade-off 


between torque gains and position gains in a distributed 
control architecture. Specifically, we explore the observation 
that raising the torque controller’s proportional gain limits the 
maximum stable position gains and vice versa. To respond to 
this observation we implemented a gain scheduling strategy; 
in the joints of the stance leg we lowered the torque gains so 
we could raise the proportional gain and reduce error. In the 
joints of the swing leg we raised the torque gains to produce 
less friction dominated behavior. 


D. Whole Body Operational Space Control 

Whole-Body Operational Space Control Q is a feedback 
control strategy based on Operational Space Control pO) , 
which extends it to floating base robots in contact with 
the environment. It allows the user to specify multiple task 
objectives and their impedance in operational space. It ad¬ 
ditionally subdivides the torques applied to the robot into 
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orthogonal spaces which affect either the motion of the robot 
or the internal forces which do not. When the user specifies 
these internal forces our implementation governs them using 
feedback. Whole-Body Operational Space Control is explained 
in Appendix 

At the implementation level, WBOSC worked - provided 
that our latencies were sufficiently small. Achieving a 1 ms 
latency required some significant software work. We re-wrote 
the firmware provided by our robot manufacturer in order to 
ensure our controller operated within a real-time thread, and to 
incorporate it into a hierarchical chain structure which ensures 
minimum latency for stacked control systems. We also reduced 
the basic computational cost of our WBOSC algorithm by 
bypassing recursive dynamics software and instead using a 
closed form expression to calculate the mass matrix. In order 
to reduce the tracking error, we added an integral term to all 
position tasks, which helps alleviate the friction difficulties 
involved in lowering the torque gains at the DSP level. This 
also reduces error due to inaccuracy in the gravity estimation 
term and other steady state errors in our dynamics model. 

When operating with the robot within a planarizing linkage 
mechanism, as is the case for the experimental section of this 
paper, we still model the robot as having a floating base. 
We then incorporate the additional inertia of this planarizer 
as a lumped mass inside the floating base. Thus, in the 
equations in Appendix [A] the generalized coordinate variable 
q corresponds to the six degrees of floating base kinematics 
plus six degrees of freedom corresponding to abduction, hip, 
and knee kinematics for the two legs as shown in Fig. 
Our matrix U corresponds to floating base kinematics. We 
also added terms to the mass matrix representing the rotor 
inertia of the motors, which showed a slight improvement in 
performance. Finally, we tuned the task gains experimentally 
using heuristics. 


E. State Estimation and Sensor Fusion 

The controller needs an estimate of the body orientation 
every millisecond, yet the motion capture system updates at 
480 Hz, occasionally fails to track a subset of the markers, 
and has a non-trivial latency of 15 ms. Given that the on 
board IMU accurately reports the rotational velocity of the 
torso with respect to an inertial reference frame, we integrate 
forwards in time to maintain an estimate of the orientation 
while waiting for an update from the overhead positioning 
system. When such an update arrives, we acknowledge the 
feedback latency of the sensing system process, and generate 
an innovation measurement based on not the most current 
value of the orientation estimate, but the estimate from 15 
controller update periods into the past - i.e. the ratio between 
the 15ms latency of the motion capture system and the 1ms 
servo rate. 

Calling this time A: = f — 15, we can setup a least squares 
problem which minimizes the distance between measured LED 
position G and predicted LED position yf G for 
i = 1,n, where n is typically 7, but decreases when LEDs 
are blocked or otherwise non-visible. Our model predicts LED 
locations based on an affine transformation of a default pattern 


y^ = where G G is the 

affine transform at time k and the default pattern, Zi G K^, is 
essentially just the measurement from some default position, 
but is shifted such that the center of the coordinate system is 
the geometric center of the points. That is the first moment is 
zero for the pattern: Sj^j^CjZi = 0 for j = 0,1, 2. This affine 
transform includes both a linear bias term which represents 
translation of the geometric center of the LEDs, and a matrix 
term which represents rotation as well as non-physical skewing 
and scaling of the pattern. The pattern is specifically designed 
such that the origin of the pattern frame is the geometric center 
of the points. As this model is linear in the affine parameters 
we can solve for the case where all the LEDs are visible as 
follows 
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where Q describes affine transforms in vector form, and 
demonstrates the linearity of prediction, and ([^ defines the 
affine transform which best describes the observed LED vector 
as a transform of the pattern. However, we have opted 
to lowpass this sensor data by adding twelve extra rows of 
regularization terms and a diagonal weighting matrix. 
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to the least squares equation. In addition, we cannot always 
assume that all LEDs are visible and we must dehne a 
knockout matrix Ko which selects the LEDs which the system 
successfully located: 


Ko G 


/ej if LED 1 was foundA 
e{ if LED 2 was found 


\eg if LED 7 was found J 


(5) 


Where (ATo 0 / 3 x 3 ) selects only equations relating to observed 
LEDs from the original regressor. 
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where the integration of the IMU orientation rate data, 
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is the basis for the regularization term setpoints, as shown in 

0 - 

The regularization adds another term to the objective func¬ 
tion, specifically the squared deviation between each element 
of the affine transform rotation matrix and our estimate of this 
matrix given the orientation estimate at time k. Regularization 
is also applied to the vector component of the affine transform, 
but the weight Ai is so small as to be neglected. The weight 
A 2 , on the other hand, represents a significant initial covariance 
which specifies the tradeoff between old knowledge and new 
knowledge. The weights were chosen such that the discrete 
time half life of an error in the orientation estimate is one 
update if all seven LEDs are visible. 

Solving this least squares problem returns a transform which 
is potentially skewed and scaled as well as rotated, and thus 
is not a valid rigid body transform. By finding the closest 
quaternion to this rotation matrix we constrain the result to 
valid transforms, using the method of pTf . Finally, with the 
orientation estimate at time k being the closest quaternion to 
the affine transform, we can integrate the IMU data from times 
k to t to estimate the orientation at time t. The integration of 
IMU data continues incrementally while the algorithm waits 
for the next piece of overhead camera data to arrive, and 
then it repeats the more complex fusion algorithm. Since the 
algorithm performed adequately the first time it was used, no 
attempt was made to precisely tune the delay estimate and 
filter constants. 


F. Contact Transitions 

In order to reduce the high speed behavior associated with 
a sudden change in joint torques, we have implemented a 
strategy which acts to smooth out the torque commands when 
the robot transitions between single and dual support. The 
sudden change of torque commands is due to the instanta¬ 
neous switch between constraint sets within WBOSC, and our 
method to smooth these torque commands effectively bridges 
the difference between single contact and dual contact. To 
make WBOSC with a single contact constraint produce the 
result it would with a dual contact constraint we simply add 
a desired reaction force to the swing foot - the same reaction 
force that would be expected of this foot in the dual contact 
case. Then, to transition, we decrease this desired reaction 
force linearly with time in the case of foot lifting, or linearly 
ramp it up from zero in the case of foot landing. This requires 
that we know this reaction force beforehand, so we must 
always run the controller with dual contact constraints before 
the single contact version. When lifting the foot we can use 
the previous value of the reaction force, but when landing we 
run the controller in dual contact once at the beginning of the 
foot landing transition phase for the sole purpose of acquiring 
this reaction force. 

To implement this desired external force in single contact 
WBOSC we add the term f^xt to Equation ( [T5| ), 

f^task — “t” A^task /^task fext- (9) 

This force fext — wfext, dual can be extracted from the dual 
contact WBOSC after the output torque is calculated based on 



Triped Robot 


Biped Robot 


Virtual actuators representing Internal Forces 


Fig. 3. Illustration of Internal Forces for Various Robots. Internal forces 
in point foot robots correspond to tensions or compressions between pairs of 
supporting contacts. 


Equation (|2^ 


fext, dual — ‘5'swing s \j-^ 2"control ^ AA] “f 


( 10 ) 


where Sswing € selects the constraint forces of the swing 
leg from those of both foot contact constraints and w G [0,1] 
represents the linear ramp. 


IV. Feedback-Based Internal Force Control 

Internal force behavior corresponds to actuator forces that 
produce no net effect on the robot’s motion. A such, internal 
forces correspond to mutually canceling forces and moments 
between pairs or groups of contact branches, i.e. tensions, 
compressions and reaction moments. For instance, a triped 
point foot robot has three internal force dimensions while a 
biped point foot robot has a single internal force dimension 
as shown in Figure 

In this context, building a biped robot with excellent torque 
sensing has two advantages: (1) its ability to use low level 
torque control to overcome actuator friction and achieve 
greater control bandwidth. In turn, our rigid body assumptions 
to model internal forces are less affected by low level actuator 
dynamics; and (2) torque sensors on the robot’s joints permit 
the implementation of sensor-based internal force feedback for 
accurate tracking. 

What is interesting about taking a model-based approach 
is that internal forces are fully controllable by definition as 
they are orthogonal to the robot’s motion. As such, both the 
robot’s movement and its internal forces can be simultaneously 
controlled to feasible values. Moreover, in many types of 
contact poses, internal forces are easily identifiable using some 
physical intuition. For instance, in the triped pose of Figure 
the three feet can generate three virtual tensions between the 
points of contact. The physics of tension forces were analyzed 
in greater detail using a virtual linkage model in ||T|. 

Internal forces are part of the core functionality of Whole- 
Body Operational Space Control. In the Appendix section, we 
describe the model-based control structures enabling direct 
control of internal forces. In particular, the basic torque 
structure derived in Equation ([4T| is written here again for 
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readability, 

Tint = Ji\l ^-Fint.ref ^int,{t} + Mi + i (H) 

where Fi„t ref is the vector of desired internal forces and 
^int,{f} corresponds to the mapping of task torques into the 
internal force manifold. The above equation is based on the 
assumption that commanded torques and actual torques are 
identical, and that the kinematic and dynamic models are 
exact. Because these premises are never true, to achieve best 
results on force regulation or tracking, we propose to employ 
sensor-based feedback control on the internal forces. To our 
knowledge, this is the first time that sensor-based feedback 
control of internal forces is proposed and achieved in a real 
robot. 

Because internal forces are fully controllable, we incorpo¬ 
rate a simple proportional controller on the internal forces into 
Equation (|43|l. 


Tint — Ji\l ^^int.ref ^int,{t} “b Mi T Pi 

T F (.^int.ref -^int,act) ), (12) 

where Kp is a proportional control gain and Tint,act are the 
actual sensor-based internal forces. In order to obtain these 
sensor-based forces, we use the torque sensors on the series 
elastic actuators to find the co-states of constraint as per 
Equation ( |29l l and apply a projections Wint to find internal 
forces. 


Tint, act — bkint 




Js 


-b- g) + KJq 


(13) 


where Tsensors corresponds to the vector of torques sensed by 
the spring element in each series elastic actuator (see Eigurej^. 

Although this internal force mapping above is distinguished 
from previous work, due to its sensor-based force feedback 
control, this mapping is valid due to the physical fact of 
robot redundancy in the multi-contact case. The induced 
contact closed loop causes the number of controlled tasks 
to be smaller than that of actuated joints. Correspondingly, 
additional DOEs are available to be controlled for more tasks, 
such as internal forces in Equation ( [T3| ). This mapped internal 
force is consistent with contact constraints and cancellation of 
accelerations on the robot’s base or on the actuated joints ||Tj. 
More details can be found in the Appendix section. 

To calculate internal forces for our bipedal robot, Hume, 
we need to define the mapping given in Equation •ED in 
Appendix where Wi^t is the matrix representing the map 
from reaction forces to internal forces. In our case, Hume 
controls the internal forces between the two feet during 
dual contact phases. In dual support, the reaction forces are 
{fRx, fRy, Jrz, fhx, fLy, JlzY, where R, L mean a right 
and left foot, respectively. According to |[Tj, Wi^t consists of a 
selection matrix of tensions, St, a rotation matrix from global 
frame to the direction parallel to the line between two contact 
points, Rt, and a differential operator matrix. At, i.e. 


VEint = St Rt At, 


(14) 


with 


St = {l 0 0), (15) 

{ A _ Pr — Pl 

y = (-x(2),x(l),0)^ , (16) 

z = X X y 

At = (/3x3 —hxs) ■ (17) 

where Pp and Pp are the position of the right and left feet, 
respectively. 

In order to compute desired internal forces, we suggest 
either to use heuristics as we do in this paper, or the use of the 
virtual linkage model and the multicontact/grasp matrix, which 
were proposed in ||T]. Those models allow to relate the center 
of mass and the internal force behavior to the reaction forces. 
A study on the usage of these models is currently beyond the 
intended scope of this paper. Compared to other methods based 
on optimizing reaction forces, our method uses sensor-based 
feedback control on the internal forces to regulate or track a 
desired tension reference with good accuracy. 

V. Trajectory Generation 

In order to stay upright our robot must constantly re¬ 
position its feet, and a mechanism of considerable complexity 
is required to decide the upcoming foot position for the swing 
foot. We have developed a method for finding this position 
which we refer to as a phase space constant time to velocity 
reversal planner. In every step, when the lifting phase reaches 
80% completion, the planner runs in an online fashion and 
returns the next footstep location before the lifting phase ends 
and the landing phase begins. The operational space set-point 
trajectory for the swing foot is then defined parametrically 
based on this desired landing position, with the trajectory 
ending once ground contact is sensed. If the planned step is 
outside the mechanical limits of the robot the step saturates to 
the closest reachable step. 

The method we use to choose this footstep location operates 
on a one dimensional model appropriate for planar walking, 
but, by choosing the x and y components of the footstep 
location as solutions to the forward and lateral planar walking 
problems, it is extended to 3D walking. We will present first 
the planar algorithm used in the experimental section going 
on to explain our approach to 3D walking. 

1) ID Velocity Reversal: Our planner attempts to stabilize 
the robot by causing the center of mass to reverse direction 
every step. Central to this undertaking is the exploitation 
of a simplified model of the robot’s zero-dynamics given 
specific operational space tasks: the prismatic inverted pen¬ 
dulum model, or PIPM. The PIPM considers a point mass 
constrained to an arbitrary continuous height surface by a 
constraint wrench which evaluates to a pure force at both the 
foot point and the center of mass - that is the model assumes 
a point foot produces the reaction force and that the reaction 
force points towards the center of mass. The PIPM can be 
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Fig. 4. Constant Time to Velocity Reversal Algorithm. As shown in (a), we approximate the dynamics of the robot with the prismatic inverted pendulum, 
shown in (b). This model predicts the dynamics of the horizontal center of mass position x given the stance foot location Xp = ^ and the height surface 
2 : = h{x). This can be integrated forward in time via the numerical integration procedure shown in (c). When the planner starts operating it records the initial 
state and integrates this state forward to determine the switching state #. As shown in timelines (d) and (e), the “Estimated sequence” of the planner has 
an analogue in the “Robot states” of the state machine. In particular, the switching state # roughly corresponds to the dual support phase of the walking state 
machine. When planning with the physical robot, the planner computes a post-impact state O by applying a velocity adjustment to the switching state. This 
empirical measure compensates for what appears to be a nearly constant decrease in velocity at every impact. This new state O represents the planner’s guess 
at the time, x, and x values immediately after the switch. The goal of the planner is ultimately to stabilize the robot, but this is implemented by choosing the 
next footstep such that velocity reverses t' seconds after the foot switch every step. For sufficiently smooth height surfaces, the relationship between the next 
footstep location px and the velocity x is monotonic, as shown in (g). We use the standard bisection algorithm to identify the next foot placement O which 
results in a velocity reversal state ♦ at time t', as shown in (f). 


expressed as the differential equation 


•• 

X = -(x — Xp). 

z 


(18) 


Accounting for z being a function of x, the height surface, 

dz 
dt 

d?z 
dt^ 


z = 


dz dx 

dx dt ’ 


(19) 

; d t dz\ dx dz d , 

rdx\ 

(20) 



dt\dxJ dt dx dt 

^ dt J 

d^z n dz 



(21) 


By plugging Equation (21 1 into Equation ( fTS] ), we obtain, 

.. g 


d z -2 I ^ ” 


(x - Xp), 


( 22 ) 


(5 + + (a; - Xp) 


dz . 
dx 


(23) 


9+B- 




(x - Xp) 


(24) 


which now lacks any z term, and can be used in Eig. to 
integrate forward in time. 

This model is a close approximate to the zero dynamics of 
our robot when a specific set of WBOSC tasks are accurately 
maintained. These tasks are: (1) a center of mass height task, 
(2) a constant body link attitude, and (3) any sufficiently 


gentle Cartesian trajectory task for swing foot. As illustrated 
in Eig. 1^ the planner begins calculating the landing location 
when 80% of the lifting phase is reached in the robot state 
machine’s progression. As such, the planner continuously re¬ 
plans in an online fashion to correct for trajectory deviations. 
Using the current estimate of the center of mass velocity and 
position, it numerically integrates forwards in time to predict 
its COM position and velocity when its stance foot and swing 
foot will switch roles. This time, position, and velocity is 
known as the switching state, O in Eig. Yet this is not 
the state from which the planner begins predicting the COM 
behavior for the upcoming swing. Instead, the planner applies 
a subtle modification to the switching state which represents 
the effect of landing. After applying this model, we arrive at 
the “post-impact state”, O in Eig. This point represents the 
planner’s best guess at the center of mass position and velocity 
immediately after the leg switch. 

The implementation of the planner enforces the choice of 
a value for the reversing time, t'. This time value remains 
constant for every step, thus the user needs only to specify a 
single parameter for the planner to operate. As of now, t' is 
manually chosen and as we show in the simulations it is able 
to stabilize the biped for an arbitrary long number of steps. 
The planner then finds and returns the footstep location which 
causes the robot’s COM velocity to reach zero, t' seconds 
after the foot switch, starting from the post-impact state. 
Eor each potential footstep location considered, the planner 
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■III Actual Path 
■Hill Planned Path 
O Planned Foot Placement 
-f Actual Landing Location 
^ Stance Foot Location 


Fig. 5. 3D Stabilization in Simulation. Subfig. (a) displays three steps of walking from forward COM phase space perspective, while (b) displays the 
lateral COM phase portrait. In both figures the smaller axes highlight the planned versus actual trajectory for each of the three steps. The steps ai‘e shown 
in (c) using the simulation graphics provided by the SrLib Multi-Body Dynamic Simulation Environment. Subfig. (d) shows a longer time period of walking 
from several angles, and demonstrates the generally stationary behavior. 


integrates forward in time starting from the post-impact state 
as suggested in Fig. [^, returning the velocity after t' seconds. 
This integration can be viewed as a function mapping footstep 
location to a future velocity, and it is this function over which 
we search for a zero crossing via bisection. Since we use 
bisection, the number of integrations actually performed is 
very low, however the process relies on the monotonicity of the 
relationship between footstep location and the velocity after t' 
seconds of integration. If the height surface is planar, then this 
relationship is linear. 

2 ) 2D Velocity Reversal for 3D walking: Choosing a foot¬ 
step for 3D walking is, under certain circumstances, identical 
to choosing the footstep for 2D walking in two orthogonal 
directions. We take advantage of this interpretation to extend 
our constant time to velocity reversal planner to 3D, as we 
split 3D walking to a forward, x, phase space and a lateral, 
leftwards or y, phase space. While we could potentially allow 
different t' parameters for the two planes, we use t' = 0.24 
seconds for our particular robot in both cases. This short time 
constant is indicative of the highly dynamic motion we are 
attempting, and is chosen to be as short as possible since faster 
stepping allows disturbances to be counteracted sooner. 

Since an attempt to cross feet laterally could result in a 
leg-leg collision, we slightly modify the procedure in order to 
reduce the likelihood of such an event. The y phase space and 
the y component of the footstep location is always computed 
before that of x, so that the y phase space is given the 
opportunity to modify the time of swing to be used by the 
X phase space process. This is accomplished in the process 
of computing the switching state in the y phase plane. If 
the y velocity hits a maximum |r/| limit of 0.65 m/s during 
the integration, then the entire step is shortened such that 
the switching point occurs where the integration reaches the 
maximum y speed. If |y| at the default switching time is less 


than 0.1to/s, the planner extends the step duration until the 
switching state has a y speed of at least this value. When 
the y phase plane process adjusts the switching time, the x 
phase plane process uses this new switching time to find its 
switching state. Since the switching time represents the point 
at which the support leg changes, it cannot differ between the 
two directions. 

Although the y directional impact does not appear to have 
a bias, an x directional impact bias of —O.Olm/s appears in 
the simulation results and is included in the planner, though 
it is smaller than that of the experiment. 

Fig. 0 provides the results of our 3D walking algorithm 
in simulation, and demonstrates that the strategy stabilizes 
the velocity of the 3D simulation robot for an arbitrary long 
number of steps. Note that in step (1) of Fig. [^the planner 
attempts to reverse velocity in x and yet the velocity remains 
positive after the step has been made. This is due in part to 
the inaccurate landing of the footstep (1), which was not far 
enough forward to reverse the motion. This is a fairly common 
problem, especially at low speeds when the footsteps are very 
close together. 

VI. Experimental Results and Assessment 

We present here experimental results supporting the ideas 
proposed in previous sections. A planarizing linkage system, 
shown in Fig. constrains motion to the sagittal plane in 
all experiments. In the first experiment, Hume stands on 
two wedges inclined inwards as shown in Fig. |^a). The 
first experiment shows balancing on a split terrain robustly 
handling human interactions. It demonstrates a successful 
implementation of WBOSC in a biped robot with elastic 
actuators, and validates the performance of internal force con¬ 
trol. In the second experiment, Hume implements a stepping 
task with the planarizer’s slider locked in place, allowing 
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(a) Balancing on Steep Basin 



(b) CoM Tracking 



x{rn) 


(c) Recovery from Human Disturbance 



Fig. 6. Elliptical Trajectory Tracking and Human Disturbance Rejection on High Pitch Split Terrain: Subfigure (a) shows Hume standing between two 
inclined wooden panels and tracking a position task with its center of mass. This position set-point follows a constant velocity trajectory along an elliptical 
path shown, along with the measured COM path, in subfigure (b). In Subfigure (c) the COM has a different impedance in the horizontal and vertical directions. 
When the robot is pushed backwards it moves as though the center of mass were connected to a low-spring-constant spring, whereas when the robot is pushed 
downwards it reacts as though connected to its set point by a far stiffer spring. Due to the feedback regulation of internal forces, the biped does not fall down 
when disturbed with large external forces. 


only vertical motion, approximately, and pitching of the body. 
We use the stepping test to validate the contact transition 
technique introduced in Sec. III-F The stepping test also tests 
the DSP-level gain scheduling strategy we used. In the final 
experiment, we show undirected dynamic walking with the 
proposed continuous re-planning method. As a final comment 
before describing further details, we note that in all controllers 
used in the experiments we omit the implementation of 
Coriolis/centrifugal terms represented by the symbols 
in Equation ([T5]l and /i^ in Equation ([T2li. 


A. Balance in a High Pitch Split Terrain 

In this experiment, the Hume biped balances on a high pitch 
terrain composed of two 45° wedges angled in towards the 
robot to create a convex floor profile. As far as we know, this is 
the first time a biped robot has been reported under conditions 
that strictly require internal force to remain standing. The 
robot’s tasks were to maintain a lOOA^ internal force pushing 
outwards against the two contact points, a desired impedance 
task for the center of mass point, and a desired impedance 
task for the body orientation. By controlling this internal force, 
Hume was able to avoid slipping while accurately adjusting 
its COM position. This experiment is divided into two sub¬ 
experiments; Hume was made to follow a time-varying COM 
trajectory which followed an elliptical path in the sagittal 
plane, as shown in Eig. |^b); and Hume was made to hold a 
Cartesian impedance task on the COM which had low stiffness 
horizontally and high stiffness vertically as shown in Eig.|^c). 
In the second sub-experiment the robot was perturbed by 
external disturbances in the form of human pushes. In this 
test, = [COMj,, COM^, qny, qR^y, = [ 0 ] 4 xi 

and Py = lOOAf, using the notation of Eig. 0 Gains are 
summarized in Table |I] 

Due to this feedback strategy, the errors between desired 
(black) and actual (red) internal forces are small enough to 
achieve a stable pose control. COM tracking performance is 


Position Gain 





Dj: 

COM^ 

15.0 

0.0 

0.0 

COM^ 

200.0 

30.0 

3.0 

QRy 

150.0 

15.0 

7.0 

QRx 

250.0 

15.0 

1.0 

Torque and Internal Force Gain 

Kp,r 

,tau 


Kf 

50 

0 


1.0 


TABLE I 

Gain Set for the Internal Foree Test. 


Deficient Internal Force Control (10 N) 

Hume follows Hume loses balance Hume totally 

an elliptical path ^ and left leg slides falls down 



Fig. 7. Deficient Internal Force Control: This figure demonstrates how 
Hume falls when the internal force is insufficient to maintain static friction 
at both feet. In this case the internal force was only 10 N which was not 
sufficient to overcome the effect of gravitational forces on the surfaces. 
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shown in Fig.|^b). Although notable noise exists, the position 
error is bounded within 2cm. The second sub experiment 
shows that feedback control of internal forces allows them 
to persist despite external forces exerted by a human. As a 
testament to the compliance of our robot, the pushes we exert 
on the robot do not produce large deviations in the internal 
forces. Fig. |^c) shows fast decay of disturbance transient 
phase, on the order of half a second. As shown in Fig. 
without sufficient internal force the robot loses frictional 
contact with the wedges and this precipitates a fall. These 
experiments ultimately show the effectiveness of closed loop 
internal force control as applied to maintaining a frictional 
contact. 

B. Stepping Test 

1) Significance: The stepping experiment offers a prelimi¬ 
nary look at the performance of the control system with regard 
to achieving the zero dynamics of the pendulum model. It 
allows us to evaluate the performance of the transition strategy 
described in Sec. EfB 

2) Setup: In this experimental setup, Hume’s x direction of 
motion was restricted by locking the sliding degree of freedom 
within the planarizing linkage. The robot’s feet, placed roughly 
beneath the center of mass of the system, were lifted one at a 
time as though the robot were marching in place. Rather than 
using the planner, desired footstep location was held constant 
for each foot. 

3) WBOSC Inputs: This stepping in place motion followed 
a time scripted state machine, shown in Fig. |^a). Since the 
states are symmetric with respect to the supporting leg being 
either right or left, states are categorized in two different 
task sets with left and right single support having symmetric 
structures. The WBOSC task set differs between dual support 
and other phases of the stepping state machine. 

As shown in Fig. |^d), the robot’s tasks were to control 
COM height, body pitch and roll angles in dual support. 
Therefore, in dual support, is [COM^, q^y, q^x]^- In 
single support, we control foot position as well, therefore it 
changes to [COM^, q^y, qRx, footx, footy, /oof^]^. Here, 
internal force feedback control is disabled due to the brief dual 
support, 0.02s. Transitions are used based on Sec. |III-F| with 
^tlsk = 0> 0^wfr,x, wfr,y, wfr^zV, where W e [0,1] is 

the scaling factor and fr- is the expected reaction force in 
the dual support case. 

4) Data Analysis & Conclusions: The stepping task also 
used the gain scheduling algorithm described in Sec. D 
With a positional COM error bounded within 2cm and an 
orientation error within Q.15rad, as can be verified in Fig.j^d), 
we can conclude that the controller has the potential to achieve 
closed loop standing through repeated stepping. Torque track¬ 
ing performance for the low level controllers is displayed in 
Fig. [^b). The results show larger torque tracking errors in 
the single supporting leg than in the swing leg. When the 
gain scheduling algorithm enables the integral gain on torque 
tracking, during swing, this error becomes far smaller, which 
is necessary in order to overcome the friction internal to the 
actuator when moving the lightweight shank link of the swing 
leg. 


Position Gain in Dual Support 



Kx 


Ix 

Dx 

COM^ 

450.0 


55.0 

5.0 

<lRy 

400.0 


55.0 

5.0 

QRx 

80.0 


10.0 

5.0 

Position Gain 

1 in Single Support 



Kx 


Ix 

Dx 

COM^ 

470.0 


60.0 

10.0 

<lRy 

300.0 


60.0 

5.0 

QRx 

100.0 


10.0 

5.0 

footx 

900.0 


100.0 

100.0 

footy 

900.0 


100.0 

50.0 

footz 

905.0 


100.0 

50.0 

Default Torque Gain 



Kp,T 


Kl,r 

Every Joint 


65 


0 

Torque Gain of Swing Leg 



Kp,x 


Kl,r 

Hip right 


200 


22 

Hip left 


180 


15 

Knee right 


260 


35 

Knee left 


255 


30 


TABLE II 

Gain Set for the Stepping Test. 


Another notable result from Fig. [^c) is that the torque 
transitions gently change between each step, as a result of 
the contact transitioning procedure. Even when the command 
changes from 100 N to 0 N this transition is handled 
gracefully. To conclude, this experiment shows that by using 
torque transitions and gain scheduling the control structure 
presented in this paper, Hume achieved a smooth and accurate 
stepping motion under WBOSC. 

C. Undirected Walking with Online Re-planning Method 

The undirected walking experiment is designed to test the 
balancing ability of our system, testing both the continuous 
time feedback of our WBOSC implementation and the discrete 
time feedback from the footstep planner. In this experiment, 
Hume continually steps forward or backward in order to 
remain upright despite its inherently unstable dynamics. The 
abduction/adduction joint of the hip, unlike the other experi¬ 
ments, was fixed using joint level position control to simplify 
the problem. The experimental setup, as well as the data from 
this experiment, are shown in Fig. 

Since the abduction/adduction joint is locked, does 

not include roll motion control, and is thus [COM^, qRyY". 
In single support, it becomes [COM^, footx, /oof^]^. 

Additionally, since the dual support period is only 0.079s long, 
internal force feedback control is also disabled. Transitions are 
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(a) State Machine 

(Transition 1 Double 
^support 




^ [Left Leg 


[ Landing ] 

( Lifting ) 




- 0.95 


J. 


% 0.9 

1 0.85 







ai 





-ai 






27 27.5 

lime (!i») 


26.5 

(d) Body Pose 



0 





N -0.5 





27 

Time (see) 


(b) Foot Trajectory 


(c) Torque Trajectory 


Fig. 8. State Machine and Stepping Motion Test: (a) Motion within the stepping experiment is divided into three states: dual support, swing leg lift, 
and swing leg landing. Additionally, a transition state exists at the beginning of the lifting phase and the end of the landing phase in order to avoid the jerk 
caused by a sudden change of effective mass. Desired foot trajectories, blue, and sensor data, red, exhibit good tracking performance in Subfigure (b). In 
Subfigure (c), desired torque trajectories, blue, are plotted alongside sensor measured torques, in red. Background color indicates the phase of walking, with 
green con'esponding to the left leg swing phase, yellow to the right leg swing phase, and white to the dual support phase. In Subfigure (d), position and 
orientation task tracking is plotted over a representative portion of the stepping experiment, with sensor data in red and desired values, dotted, in blue. Height 
refers to COM height. 


implemented identically to the way they were implemented in 
the stepping test, and as described in Sec. |III-F| 

The difficulty of this challenge, when constrained to the 
sagittal plane, is highly dependent on the amount of time the 
robot spends in dual contact, which is naturally stable. We 
spend an almost trivial amount of time in dual support, limited 
primarily by the speed at which our contact transitions can 
proceed. The experimental velocity data used in Fig. are 
filtered by a steady state PIPM based observer. This filtered 
data are used only by the planner, and are not fed back by the 
WBOSC implementation. 

A critical test of our planning methodology is whether or 
not the simplified model we used to approximate the zero 
dynamics of the controlled system are an accurate predictor 
of the future center of mass state. And demonstrated by Fig.|^ 
the model predicts the continued evolution of the current 
step accurately, and the evolution of the second step with a 
much larger propensity to deviate widely. This is because the 
landing event is a major source of uncertainty. Not only is the 
controller performing a transition maneuver at this point, but 
the landing error of the foot is not insignificant relative to the 
length of an average step in both space and time. 

Noticing that the robot reacts to contact transitions with a 
consistent bias, we model a —0.1 m/s jump in velocity at each 
transition to reflect this reality. This impact model separates 
the switching state from the post-impact state in Fig. The 
five miniature phase plots in Fig verify that this simple 


methodology successfully predicts the COM^; path in the next 
step. 

In the 16th step, Hume’s foot deviates the planned location 
by 5.5 cm, and the planned future trajectory differs signif¬ 
icantly from the truth, progressing beyond the foot location 
rather than changing direction in constant time as was the 
intention of the planner. This highlights the fact that the 
dynamics of the robot are highly sensitive to footstep position 
error, and also demonstrates that while the robot’s footstep 
accuracy was very high in the stepping experiment, this accu¬ 
racy decreases as the foot is required to follow faster landing 
trajectories with more significant forward motion. However, 
this mis-step is brought back under control in the next step 
and the COM slows down considerably by the 20th step. Once 
the robot has slowed down to this degree, it becomes nearly 
impossible to predict the direction it will fall, which causes 
the robot to drift sideways until it hits the mechanical travel 
limitation of the planarizer. 

VII. Conclusion 

Operational Space Control was originally conceived for 
mobile manipulators to simultaneously control forces and 
accelerations in a dynamically consistent manner. Whole-Body 
Operational Space Control further extended this methodology 
for floating base humanoid robots with natural constraints. In 
particular, it defines the space of internal forces as a fully 
controllable task system that is orthogonal to the robot’s 
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0 0.5 0^4 oie 0.8 1 

16th Step 17tli step 



“ Actual Path in the ith step 
— Actual Path in the (i + l)th step 
_ Planned Path in the ith step 
■ Planned Path in the {i + l)th step 


O Planned Foot Placement 
4- Actual Landing Location 
Stance Foot Location 


X 


-0.4 



X 



Fig. 9. Undirectional Walking. Subfig. (a) shows the x-directional trajectory of the COM. The blue line represents actual data while the red lines indicate 
the trajectories expected by the planner. Subfig. (b) displays 23 steps of undirected walking. The central plot focuses on steps 16-21, of which 16-20 are 
expanded into individual step planning plots. In each step planning plot, a red line marks the actual COM path up to the switching state, and a green line 
continues the trajectory after the switch. The robot’s initial stance foot in the step planning plot is denoted with a the planned second footstep with a 
black circle, and the achieved second stance foot location with a blue cross. Therefore a green line in the ith step plot is the same path as the red line in the 
(i + l)th step plot. A black line and a dark green dotted line are, respectively, the PIPM model’s predicted paths before and after the switching state. 


acceleration manifold. This interpretation allows to employ 
feedback control techniques to regulate or track internal force 
reference inputs with good precision. For instance, in this 
paper we implement a proportional closed loop feedback 
control process to regulate sensor-based internal forces to a 
desired value for balancing on high pitch split surfaces. This 
case scenario exemplifies the need for a unified force/motion 
feedback control approach for this peculiar type of balancing 
which is facilitated by WBOSC. 

Point foot robots like ours cannot control their center of 
mass in single support without exploiting the state-dependence 
of the gravity and Coriolis vectors. Therefore it is difficult 
for them to implement locomotion strategies based on center 
of mass tracking such us those using the Zero Moment 
Point or the Capture Point. When we started this research 
we were motivated to attain non-periodic gaits. We were 
driven by applications such as rough terrain walking, jumping 
between vertical walls, and push recovery. At that time there 


were no algorithms suited for those type of behaviors in 
point foot robots. We devised a new rule-based algorithm, 
which we dubbed Phase-Space Planning pZ) , in which foot 
positions and apex velocities were known a priory. Phase 
space techniques were then employed to hnd transitions states 
between the steps. For this new work, we decided to extend 
phase space techniques to the general case of continuously 
stepping such that the robot’s center of mass velocity could be 
quickly reversed. The overall effect is an undirected walk that 
stabilizes the robot through continuous replanning capabilities. 
Although such walking does not necessarily stay in place, our 
experiment has two goals: to create an environment for future 
push recovery, and to create an algorithm for 3D untethered 
balancing. This rule was effective in stabilizing the robot in 
a 3D simulation as well as in the physical system with a 2D 
planarizing constraint. 

What are the advantages of calculating output joint torques 
on the SEAs versus, for instance, incorporating six axis 
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Position Gain in Dual Support 





Dx 

COMd 

350.0 

35.0 

10.0 

QRy 

150.0 

35.0 

5.0 

Position Gain 

in Single Support 





Dx 

COM^ 

350.0 

35.0 

10.0 

iRy 

150.0 

35.0 

10.0 

footx 

300.0 

50.0 

15.0 

footz 

300.0 

50.0 

10.0 

Default Torque Gain 



Kp,r 

Kpr 

Both Hip Joints 


94 

0 

Both Knee Joints 


98 

0 

Torque Gain of Swing Leg 



Kp,r 

Ki,x 

Hip right 


200 

22 

Hip left 


200 

22 

Knee right 


275 

35 

Knee left 


270 

32 


TABLE III 

Gain Set for the Undirected Walking Test. 


Phase 

Time (sec) 

Transition 

0.02 

Lifting 

0.23 

Landing 

0.26 

Double Support 

0.079 

t' 

0.25 


TABLE IV 

Time Parameters for the Undirected Walking Test 


force/torque sensors on the biped’s feet? The main advantage 
is that we can implement sensor-based feedback control behav¬ 
iors between multiple contacts on any part of the robot’s body. 
For instance, a biped robot like ours could balance using its 
knees against surfaces instead of its feet. In that case, internal 
forces between the knees can be readily calculated using the 
joint torque sensors. This capability will enable bipedal robots 
to achieve remarkable levels of agility using any part of their 
bodies. 

One of the main advantages of using WBOSC versus 
other controllers is that internal forces can be tracked with 
high fidelity based on sensor feedback. WBOSC is the first 
controller to have sensor-based feedback control capabilities 
on the internal forces. However, internal forces need to be 
computed such that the robot can balance on the steep terrain. 
We have chosen a high outward tension between the feet 
in contact such that they overcome gravity forces in our 


steep teiTain test. However, more sophisticated techniques are 
recommended to automate the balancing process. In particular, 
the multicontact/grasp model presented in ||T] provides a 
potential tool to extract internal forces given the desired center 
of mass behavior of the robot. More specifically, given desired 
center of mass behavior, we could use that model to extract 
reaction forces that fulfill friction constraints and then project 
the resulting reaction forces into the internal force manifold 
for feedback control. 

While our experimental results in this paper have focused on 
the robot under planarizing constraint, it is an aim of our group 
to recreate the 3D balancing behavior of the simulation in 
physical hardware. This may require significant performance 
improvements in the sensing and state estimation system, and 
represents a more difficult experimental procedure as well. It 
may also become necessary to upgrade the abduction motors, 
two of which burned in the process of tuning the robot for 
planer walking. 

In summary, we have presented a method to place the 
internal forces of multi-contact under feedback regulation and 
validated that this method can prevent slipping in steeply 
angular teiTains. We have verified the practicality of WBOSC 
for maintaining a zero dynamic manifold in an under-actuated 
system which permits discrete time footstep control. We have 
developed a discrete time controller which can stabilize point 
foot robots in 3D simulation, and we have shown that it can 
stabilize our physical robot in 2D. 

Appendix A 

Basics of Whole Body Operational Space Control 

Whole-Body Operational Space Control was first laid out in 
0 with a further exploration of internal forces to be published 
later in ||T]. Interested readers should refer to these sources 
for a description of the theoretical background complete with 
proofs for the concepts below, as space limits us to a cursory 
overview of WBOSC as applied to bipedal robots. Modeling 
biped poses entails representing not only the state of each 
joint Qi,..., but also the states of the robot as an 

object in space. We choose to parameterize this as a 6- 
dimensional floating joint between the world and the robot’s 
hip coordinate system with state vector qj, G M®. Combining 
the robot and base states into a single vector we arrive at 
<7 S the generalized joint vector. The 

joint torques can only directly effect the joints themselves, and 
not the floating base dynamics, so we say an under-actuation 
matrix U G ]^("’dofs-6) xridofe selects from the joint vector to 
the subspace of actuated joints. 

gact = u q, (25) 

with (/act G being the robot’s actuated joints. The 

dynamics of the robot’s generalized joints can be described 
by a single, linear, second order differential equation 

Aq + b + g = C/^Tcontrol (26) 

where {A,b,g} represent the inertia matrix. 
Coriolis/centrifugal forces, and gravitational forces, 
respectively while Tcontroi is the desired control command 
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applied to the output joints of the robot. Without considering 
contacts, or the subtle non-holonomic effect of angular 
momentum, joint torques would have no effect on the 
geometrically uncontrollable six dimensional subspace of the 
generalized joints: {z G = 0}. However 

we can sometimes gain the ability to control more of this 
space due to contact constraints. 

We consider two contact cases for point-foot biped robots: 
single support in which one foot is in contact, and dual 
support where the robot is supported by both feet. In the single 
support phase we describe the contact via a support Jacobian 
Js G which maps from generalized joint velocity 

to the velocity of the constrained foot point in Cartesian 
space. When considering dual contact, our support Jacobian 
represents twice as many constraints. Since this generalized 
point, either the single foot point in or the dual foot point 
in M®, is constrained, we know its acceleration must be zero. 
Substituting the constraint Jsq + JsQ = :Efoot(or feet) = 0 
and adding the associated co-state of constraint space reaction 
forces A, the dynamics become 

Aq + b + g + JJA = C/^Tcontroi, (27) 


and we can hnd q and A by solving the matrix equation 


f A Jj\ / q\ ^ /C/'^Tcontrol -b- Q 
\Js Q )V)~\ -Jsq 


(28) 


Converting to upper diagonal form via Gaussian elimination 
we hnd 


In the standard model of whole body control the matrix would 
rehect only the inertia of the rigid bodies of the robot. Yet 
in the series elastic case this is equivalent to assuming ideal 
performance: torque sensor measurements perfectly following 
the desired torque. With the lower force gains we used in the 
experiments this is less rehective of reality, and we include 
the rehected rotor inertia of the motor rotors in the mass 
matrix. This addition better rehects the state of setting the 
motor current without force feedback, but in reality the system 
is somewhere in the middle of these two extremes. Fortunately 
the behavior of the robot is not especially sensitive to these 
slight variations on the mass matrix in practice. 

With those assumptions, Whole-Body Operational Space 
Control for an operational task representation, ptask, is dehned 
by the differential kinematic equation 


Ptask — "Aask Qactt (33) 

where = JtuskUNs G R”t“kXnact contact 

consistent task Jacobian and Jtask G K"tii=kXndofs j-jjg 
unconstrained task Jacobian. The basic control structure for 
the single support phase of the biped is thus 


200111™! — Jtask^taskj 


(34) 


with Ftask being the entry point for feedback control laws to 
govern trajectories, applied forces, or combinations of the two 
in the operational space. For instance, when controlling an 
operational space position trajectory, we use the model-based 
control law 



(7 Tcontrol b g 


Js \jJ Tiontrol b T Agjsq j 


(29) 


with As = [JgA ^Jj] ^ and = A ^jjAg. Substituting 
Nj ^ I - jjA.JsA-^ to more conveniently express the 
resulting constrained dynamic equation. 


Aq + {b + g) + Jj As Jsq = {UNsf Tco„troi. (30) 


This can be viewed as constraining the dynamics to the 
dynamically consistent null-space of the constraint by dehning 
the dynamically consistent pseudo inversion operator 

X = A-^X^[XA-^X^]-^ (31) 

and observing that 

Ns=I-JsJs (32) 

is the null space projector of Js under dynamically consistent 
inversion such that JsNs = 0, NsA~^jJ = A~^Nj jJ = 0, 
and NsNs = Ng. 

The above dynamic equation is only valid if we assume that 
the robot actuators are completely rigid and that no friction 
occurs in the joints. Obviously this is not the case for our 
series-elastic biped. However, the joint level torque controllers 
in our robot are designed to make the actuators behave close 
to an ideal torque source. In that case, the model above more 
closely approximates the real dynamics. 

There are also some subtleties associated with the inertia 
matrix in this model given imperfect series elastic actuators. 


Atask — ^task^ttask + Mtask “f Ftask) (35) 

with Mtask being a desired acceleration for the operational ref¬ 
erence. are inertia, velocity-based forces, 

and gravity based forces in the operational space that can 
be found in the previous references. Under idealized perfect 
conditions, the effect of the above control command is the 
linearized closed loop dynamics, 

:7task — t/task- (36) 

In the case of double support, there appear closed loop 
effects between the legs of the robot in contact with the 
environment. Our previous work has thoroughly addressed this 
problem by creating structures to control the internal forces. 
Internal forces are dehned as those that are orthogonal to joint 
accelerations. As such, internal forces do not produce any 
movement and only contribute to force generation within the 
closed-loop formed by the contacts. Analyzing the right hand 
side of Equation ( [30| , those forces correspond to the manifold 

([/ Ns)^ Tcontrol = 0, (37) 

which rehect the cancellation of acceleration effects. There¬ 
fore, the torques that fulhll the above constraint belong to the 
null space of {U Ns), which is dehned by the projection 

L* = {l-UN sTTWs)- (38) 

The torques associated with internal forces are those that do 
not contribute to net movement, i.e., 

fcontrol — A Tint; (39) 
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where Tint is the set-point for the internal forces. Thus, 
when simultaneously controlling operational space tasks and 
internal forces we superimpose the orthogonal structures of 
Equation and ( [39] l yielding the whole-body operational 
space control command 

^control = '^task'^task + L "^Tint. (40) 

Internal forces can be physically defined as linear forces 
and mutually canceling reaction moments between pairs of 
supporting contacts. As explained in our previous works, such 
forces are expressed via the equation, 

i^int = Vkint Fr, (41) 

where Fr is the set of all reaction forces on the environment 
and Wint is a matrix containing geometric transformations 
and selection criteria to extract the internal forces. Using this 
mapping, we demonstrated that the dynamics of internal forces 
correspond to 

^int = Tint + ^int,{t} Mi Pit (42) 

where = i^L*UJ Additionally, Tint,{t} ^re in¬ 
ternal forces induced by task behavior, i.e. Fint,{t} = 

_ 'j' 

WmtJs >A*ask-^task, and pi and Pi are Coriolis/centrifugal and 
gravitational effects on the internal forces. Inverting the above 
equation, we derive the torques needed to accomplish a desired 
internal force, 

Tint = Ji\l ^^int.ref ^int,{t} + Mi "f P^ ) (43) 

- ^ 

where J*^ is the MoorePenrose left-pseudoinverse of Jj|;, also 
referred to as the reduced Jacobian of internal forces acting on 
the contact closed loops, and Fint.ref is the vector of desired 
internal forces which we use as an entry point to control 
internal forces. 
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